DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation